Difference Sequence Compression of Multidimensional Databases

نویسنده

  • István Szépkúti
چکیده

The multidimensional databases often use compression techniques in order to decrease the size of the database. This paper introduces a new method called difference sequence compression. Under some conditions, this new technique is able to create a smaller size multidimensional database than others like single count header compression, logical position compression or base-offset compression.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Difference-Huffman Coding of Multidimensional Databases

A new compression method called difference-Huffman coding (DHC) is introduced in this paper. It is verified empirically that DHC results in a smaller multidimensional physical representation than those for other previously published techniques (single count header compression, logical position compression, base offset compression and difference sequence compression) . The article examines how c...

متن کامل

On the Scalability of Multidimensional Databases

It is commonly accepted in the practice of on-line analytical processing databases that the multidimensional database organization is less scalable than the relational one. It is easy to see that the size of the multidimensional organization may increase very quickly. For example, if we introduce one additional dimension, then the total number of possible cells will be at least doubled. However...

متن کامل

Efficient Pattern Matching of Multidimensional Sequences

We address the problem of the similarity search in large multidimensional sequence databases. Most of previous work focused on similarity matching and retrieval of one-dimensional sequences. However, many new applications such as weather data or music databases need to handle multidimensional sequences. In this paper, we present the efficient search method for finding similar sequences to a giv...

متن کامل

DNA Lossless Differential Compression Algorithm based on Similarity of Genomic Sequence Database

Modern biological science produces vast amounts of genomic sequence data. This is fuelling the need for efficient algorithms for sequence compression and analysis. Data compression and the associated techniques coming from information theory are often perceived as being of interest for data communication and storage. In recent years, a substantial effort has been made for the application of tex...

متن کامل

Sparsity Handling and Data Explosion in OLAP Systems

A common problem with OnLine Analytical Processing (OLAP) databases is data explosion data size multiplies, when it is loaded from the source data into multidimensional cubes. Data explosion is not an issue for small databases, but can be serious problems with large databases. In this paper we discuss the sparsity and data explosion phenomenon in multidimensional data model, which lie at the co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1103.3857  شماره 

صفحات  -

تاریخ انتشار 2004